Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Acoustic-Phonetic Decoding : An Important Issue in Continuous Speech Recognition

Identifieur interne : 00D360 ( Main/Exploration ); précédent : 00D359; suivant : 00D361

Acoustic-Phonetic Decoding : An Important Issue in Continuous Speech Recognition

Auteurs : Jean-Paul Haton [France]

Source :

RBID : CRIN:haton92g

English descriptors

Abstract

The acoustic-phonetic decoding of speech (i.e. the transformation of the acoustic continuum of the speech signal into a description under the form of discrete, linguistic units) constitutes an important step and a major bottleneck in the process of automatic speech recognition. This paper presents the problem and its difficulties together with the different families of solutions proposed so far. After a recall of the methods based on pattern matching techniques and stochastic models we introduce a class of methods based on artificial intelligence knowledge-based techniques. Such methods make an explicit use of all available types of knowledge that intervene in speech perception. We then present the use of neural connectionist models and discuss their interest for the problem. The presentation will be illustrated by practical examples drawn from different systems.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" wicri:score="593">Acoustic-Phonetic Decoding : An Important Issue in Continuous Speech Recognition</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:haton92g</idno>
<date when="1992" year="1992">1992</date>
<idno type="wicri:Area/Crin/Corpus">000F44</idno>
<idno type="wicri:Area/Crin/Curation">000F44</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">000F44</idno>
<idno type="wicri:Area/Crin/Checkpoint">003617</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">003617</idno>
<idno type="wicri:Area/Main/Merge">00DC37</idno>
<idno type="wicri:Area/Main/Curation">00D360</idno>
<idno type="wicri:Area/Main/Exploration">00D360</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Acoustic-Phonetic Decoding : An Important Issue in Continuous Speech Recognition</title>
<author>
<name sortKey="Haton, J P" sort="Haton, J P" uniqKey="Haton J" first="J.-P." last="Haton">Jean-Paul Haton</name>
<affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="laboratoire" n="5">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="institution">Centre national de la recherche scientifique</orgName>
<orgName type="institution">Institut national de recherche en informatique et en automatique</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>acoustic-phonetic decoding</term>
<term>speech recognition</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="2210">The acoustic-phonetic decoding of speech (i.e. the transformation of the acoustic continuum of the speech signal into a description under the form of discrete, linguistic units) constitutes an important step and a major bottleneck in the process of automatic speech recognition. This paper presents the problem and its difficulties together with the different families of solutions proposed so far. After a recall of the methods based on pattern matching techniques and stochastic models we introduce a class of methods based on artificial intelligence knowledge-based techniques. Such methods make an explicit use of all available types of knowledge that intervene in speech perception. We then present the use of neural connectionist models and discuss their interest for the problem. The presentation will be illustrated by practical examples drawn from different systems.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Nancy</li>
</settlement>
<orgName>
<li>Centre national de la recherche scientifique</li>
<li>Institut national de recherche en informatique et en automatique</li>
<li>Laboratoire lorrain de recherche en informatique et ses applications</li>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Grand Est">
<name sortKey="Haton, J P" sort="Haton, J P" uniqKey="Haton J" first="J.-P." last="Haton">Jean-Paul Haton</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 00D360 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 00D360 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     CRIN:haton92g
   |texte=   Acoustic-Phonetic Decoding : An Important Issue in Continuous Speech Recognition
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022